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Title of the I nvention 
5 Knowledge Tree Medical Enablement 

Field of the Invention 
The present invention relates to knowledge tree enablement within the 
medical field and more particularly but net exclusively to knowledge tree enablement 
10 in the field of studying side effect, of drugs, «ith particular reference to liver tonicity. 

Background of the Invention 
Pharmaceuticals are required to undergo a demanding set of clinical trials in 
ord,r to prove both that they work and that they work safely, that is without untoward 
IS *kh> effects. Also, different pharmaceutical, interact with each other so that certain- 
drug comWoaions cannot safely be prescribed. The interactions between, drugs are 
not widely understood and there is a tendency not to prescribe ift any case of doubt. 

CUnica. trials generally involve hrge numbers of patients and by their nature 
produce large amounts of data. Hie results of the trial are generally assessed and 
20 conclusions reached as to the safety and efficacy of the drug. 

However, analysis of large amount, of data is difficult. Each patient or 
volunteer participating in the trial has a medical history and it is difficult to spot 
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patterns in the data that relate., for example, to a particular item of medical history. 
Thus it is also known to use automatic methods of data mining., in which computers 
are used to find all the possible relationships within a dam set. Such on approach is 
unhelpful in the case of clinical trials since many trivial relationships are found. 
5 furthermore an enormous amount of processing power is required to run such data 
mining, and a large amount of human analysis is required afterwards to remove all of 
the trivial relationships and to concentrate on the important data. 



Summary of the Invention 

Q ID According to a f.rst aspect of tie present invention there is thus an automated 

modeler for modeling of an interactive system comprising at least ere biological 
entity and at least one pharmaceutical substance, the system comprising: 
a representation of states of the system, 

an input, associated mth the representation for allowing users to define at 
13 least one of, the states, expected relationships between the states and independent 
inputs to the states, 

a data miner ****** with the reputation to operate on data taken from 
the system to apply the data to the mm in accordance with the det.ned relationships 
and inputs, thereby u, apply numerical values to the relationships and the inputs, 
20 thereby to model the interactions. 

Preferably, the states include beneficial action* of the pharmaceutical 
substance. 
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, • iv ihe states include harmful actios of the 
Additionally or alternately, the states 

pham.aceutic&l substance. 

phannwal subsm™ will.it. OK 

plurality of pharmaceutical substances. 

and a plurality of biological entities. 

Preferably, the data is clinical trial data. 
0 Prsfaabiy the model is usable to direct a clinical trial. 

Prc ferably ; the model is usable to direct drug administration to a patient. 

the. model, and a third data set to test flu model. 

A cccr^ to . second aspect of the present invention there is provided an 

p _^ S uhst^^ 

a ..preseiitcliui 0 r sums of *c ■ppli--»ti™. 

„ h p» M »d,=a w* « «" — * — - *** 

. *« «n* - W *< *" » °^' e " *" 
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inputs, thereby to apply numerical values to the interactions and the inputs, to model 
the application. 

Preferably, the application comprises a plurality of pharmaceutical 

substances. 

5 According to a third aspect of the pressnt invention there is provided a 

system tor predicting likely liver toxicity as a side effect of application of a 
pharmaceutical substance, the system comprising: 

an input device for obtaining blood levels of ALT and AST respectively, 
& comparator, associated with the input device for comparing the respective 
1 0 levels of Alt and AST to produce a raric of the levels, and 

a predictor associated both with the input device and the comparator, for 
predicting, from the levels and the ratio therebetween, a likelihood of development of 
liver toxicity, 



q Preferably, the predictor is set to conclude from low ALT and AST levels 

^ 1 5 and a ratio close to 1 , that a likelihood of development of liver toxicity is low. 



Preferably, the predictor is set to conclude from high ALT and AST levels, 
that a likelihood of liver toxicity is relatively high. 

Preferably, the predictor is set to conclude from a ratio not close to .1, that a 
likelihood of liver toxicity is relatively high. 

Preferably, the predictor is set to conclude from a ratio close to 1, that a 
likelihood of liver toxicity is relatively low. 
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A preferred embodiment further comprises a threshold* for setting a 
threshold likelihood, above which application of the pharmaceutical substance is to be 
discontinued. 

According to a fourth aspect of the present invention there is provided a 
5 method for modeling an interaction between at least one biological system and at least 
one pharmaceutical substance, the method composing: 
building a state diagram of the interaction, 

entering at least one of inputs to the states and interactions between the 

states, 

1 o defining at least one output ftom at least one of the states, 

obtaining empirical data regarding the interaction, 

carrying out data mining on the empirical data to assign at least one of values 
to the relationship and functions to the states, thereby to obtain a quantitative model 
of the interaction. 
? 5 Preferably, the method comprises 

randomly dividing die empirical data set into at least two data sets, 
performing the data mining using only one of the sets, and 
testing the model using a remaining one of the sets to ensure that the data has 
not been overfitted. 

20 

Brief Description of the Drawings 
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For a better understanding of the invention and to show how the same may 
be carried into effect, reference will now be made, purely by way of example, to the 
accompanying drawings, in which: 

'Fig. 1 is a simplified diagram showing a modeling apparatus according to a 
5 first embodiment of the present invention, 

Fig. 2 is a qualitative model for studying liver tonicity for use in the 
embodiment of Fig. 1, 

Fig. 3 is an alternative qualitative modal tor studying liver toxicity for use in 
ihe embodiment of Fig. 1, 
. p fig. 4 i s a further node for addition to either of the models of Fig. 3 or Fig. 4 

□ 

H for extending the model to control of a clinical trial, 

9 Fig. SA is a graph showing the relationship between ALT and AST liver 

-<5 enzymes in the blood, 

Fig. SB is a diagram of a knowledge tree that incorporates a linear 
O <5 relationship between 'Ac ALT and AST enzymes as inputs leading to a diagnosis, and 

Q fig. 6 ,s 3 simplified flow chart showing a method of modeling an interaction 

between a biological entity and a pharmaceutical substance according to an 
embodiment of the present invention. 



Description of the Preferred Embodiments 
Reference Is now made to Fig. 1, which is a simplified schematic diagram of 
an automated modeler for modeling of an interactive system. An interactive system 
10 composes a biological entity and a pharmaceutical substance. Generally, it is of 
6 
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great interest to understand the interactions with™ the system, so as to allow for safe 
use of the pharmaceutical substance. The modeler b initially supplied with a state 
model 12 of the system 10. generally as a logical entity within a computer. The state 
model may be specific to the system being tested or it may be generalized for a certain 
5 tvpe of „a*n. Generally, me number of states i. small at least initially. As the 
problem is considered further it is generally bioten down into waller and smaller 
units. Th.s the state model has a tendency to grow in size and the number of states 
increa.es. Initially, at n> early stage of understanding it is often easier to work with a 
5 mall number of nodes each having a large number of inputs and outputs, but as the 
10 model is refined, a larger number of nodes each reflect more direct causal 

0 relationships between a smaller number of inputs and output. The nodes at this more 
S advanced stage are easier to analyze In terms of data mining and the system tends to 

Q 

f»g produce more accurate results. 

1 Each node Is preferably assigned a function that relates between inputs and 
f 15 c^t, At the start a presumed function may be used. For example a node may be 
0 defined for determining the quantity of bilirubin ia the blood. There are two types of 
Q bSirubin, direct bilirubin and indirect bilirubin, each of which i S measured separately. 
| Each of the two measurements are set as inputs to the node and a summing function is 
M set within the node to give an output indicative of the total amount of bilirubin. 

20 initially the taction is either guessed or is fitted from the test data during the 

data mining procedure, as will be discussed i, more detail below. The function is 
fen refined using empirical data in an operation of function fitting. The function is 
preferably refined with empirical data to steadily reduce the output variance. As will 
be discussed in more detail below, overling of the taction is preferably avoided . 
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The state mods! 12 comprises a representation of states of said system, but 
preferably dot* not initially include relationships therebetween, model inputs or 
quantities. 

The modeler further comprises knowledge tree builder 14, connected to the 
5 representation or state model 12, for allowing users to define states in the model and 
expected relationships between the states and independent inputs to the states. The 
idea is that experts., persons expected to know about the system, are selected and 
asked to define relationships between the states, and inputs to the states, that they 
believe to be significant. Again, at this stage, no attempt is made to quantify any of 
10 the relationships, The experts are simply asked to define relationships that they 
believe, in the light of their experience, to exist. 

In the praam case, suitable experts may include pharmaceutical researchers 
foliar with triais of similar drugs, and specialist medical practitioners familiar with 
the condition that the pharmaceutical substance currently under test is intended to 
(5 rreat. It could include patients involved in the trial if deemed appropriate. Generally 
it Is regarded as advantageous to apply as wide a range of experience to the input 
stage of the modeling as possible. 

The modeler preferably has a database 16 of quantified information 
regarding the *o»n to be modeled. Traditional modeling systems based on 
20 quantities of cte have attempted to find every possible- relationship within the data 
and have tended to produce unwieldy models and require processing resources out of 
all proportion to the data available and rising exponentially with increases in the size 
of the database. 



A data miner 18 preferably operates in association with the representation 
and with the database to apply values to the relationships. That is to say the model, 
with its expert-defined relationships, is used as a guide to the dm mining process. 
The data mining process is thereby restricted to the study of relationships regarded by 
experts as important or relevant and in effect, a dimension reduction is achieved in the 
modeling process. The result is a quantified mode! encompassing all the states and all 
the interactions therebetween that have been previously decided to be important. A 
further redaction in the model may then be carried out by removing relationships 
whose quantities are found to be below a threshold value. Alternatively or 
additionally, relationships or inputs whose effect on an output is found to be 
negligible, may be removed. 

In a further stage, if the model thus formed is found to be inadequate, the 
experts may be consulted again to find out if a relationship is missing. The 
quantification process may then U repeated with the new relationship. That is to say, 
when checking the data, the variance in the outputs is vety large. In such a c^e a new 
relationship is preferably added to the model. 

Returning » the question of function fining, in theory it is possible to 
produce a function of very high order which covers exactly every single data point in 
the input. Such a function however is viewed as of limited predictive ability because 
) it includes all die noise in the original data. Using functions of such high order is 
considered as overfittmg the function. In order to prevent overfitting it is possible to 
r5Ilt i om ly divide the da* into three groups and to use the first group for initial 
quantization of the mod* the second group for calibration of the model and the third 
group simply for testing the mode,. The dtird group is not used to refine the model 
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but simply to ensure that the variance around the third group is the same as with the 
other groups, thereby determining that overfitting of the data has not occurred. 

The system being modeled is generally the interaction between one or more 
biological entities and one or more pharmaceutical substances. A particular beneficial 
5 interaction may he the subject of the study or a particular harmful interaction, or the 
mode! may simply seek to mode! the interactions as a whole. Once the model has 
been created it may be used to predict possible interactions, including side effects, 
when administered to a patient. For example the model may contain a relationship 
between age and a particular side effect of the drug. The prescribing physician can 
10 therefore use the model to note that the danger of the side affect is enhanced and 
guide both himsel f and the patient as to particular effects to watch out for in particular 
| cases. The physician b also provided with reliable information on which to base a 

decision that a particular drug is too dangerous to prescribe for a given patient or that, 
canaiy » P«vw« belief, a certain drug is perfectly safe to give to a particular 
15 patient. The Utter case may result, for example, from the, ability of the model to relate 
a particular side effect to age or any other patient variable, which the physician is then 
able tu see is not appl icable to his current patient. 

The system may typically be used in a study of a single pharmaceutical 
substance used in numerous patients, that is to say its typical use is in modeling data 
20 from various stages of clinical trials. Subsequently the model may be used as a 
predictor in the use of the substance an individual patients. Of course the results of 
the treatment of the individual patient may then be added to the database for 
subsequent refinements of the model. 
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It is noted however, that the system may also be used for studying the use of 
a combination of drug-? on patients. Generally, in such a case, interactions are poorly 
understood, and prescribing rules for drug interactions generally assume the worst. 
That is to say : if there is a know* serious side effect of the particular drug 
5 combination then the physician will usually be led to avoid prescribing the 
combination in all circumstances. A predictive model that includes interactions 
between drugs may therefore serve as a means of allowing the prescription of a wider 
range of drug combinations. 

The predictive power of the model may be used to direct a clinical trial. For 
10 example d>» gathered up to a certain point may reveal that the drug is harmful to 
3 patients over a certain age, and thus the trial may be supped in respect of part 

| over that age but continued in respect of everyone else. Currently the trials are most 

Q likely to he stopped altogether. 

g A particularly preferred embodiment of the present mention uses a first set 

l , 5 of data to build the model and then runs the model on a second at of data to calibrate 

H Ac model. Prefer*** a third set of data if available, may be used to test the calibrated 



03 model. 
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Referent is now made to Fig, 2. which is a simplified diagram of a 
, v ,a.i,tive model for measuring liver toxicity effects of a drug- The model is the 
version aft,, expert input but before data mining. The model contains two nodes, a 
firs, node 22 *kes symptom., sign, and Moratory test results as input and produces 
m output of "type of hepatic dys«on". A second node 24 receives the 'type of 
hepatic dysfunction" a, an input, along with previous patient history, severity of 
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hepatic clinical function and serum levels of the drug, and produces an output of 
''potential for liver toxicity". 

The model of Fig. 2 is from the point of view of the individual patient, and 
indicates away of fixing the thought processes of the expert. The expert analyzes the 
5 patient clinical profile, which includes symptoms (subjective description of the 
patient's feelings) signs (objective marks on the patient's body) and laboratory test 
results, as described above. Using the patient's clinical profile the physician would 
normally decide on the diagnosis. More specifically, the physician may decide 
whether the patient has any hepatic dysftanction and refine the diagnosis concerning 
10 the livtir. The above process is embodied in the knowledge tree of Fig. 2. 

The model as described abovi is sufficient to guide a data miner utility to 



O create a formula for node 22 . 

a 

~-P Similarly node 24 includes four inputs as show., serum levels of the drag, 

P type cf hepatic dysfunction, severity and previous history. The output of box 24 is the 

Q 1 5 potential for liver toxicity of the specific drug. Following a similar analysis using a 

q data mining utility the pharmaceutical company or prescribing practitioner is supplied 

| with considerable information regarding the safety or efficacy of the drug. 

Beyond relationships within individual nodes, the model also supplies 
information about relationships between nodes. Thus the output of first node 22 can 
20 be used as an input to second node 24. The relationship between the wo nodes is that 
of cause and effect, which is to say that the output of the first node is one of the 
causes or inputs of the second node. I: is pointed out that a case and effect 
relationship can be examined in two ways, given a tause I can predict an effect, but 
additionally, given an effect I can use the tree to make assumptions about the cause. 
12 
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in the present case, moving from a final output of potential for liver toxicity or* can 
detenr.ine the clinical finding in the patient that warned of the toxicity. 

Reference is now made to Fig. 3, vhibh is a simpl ified diagram of a variation 
of the model of Fig. Z Part, that are the same as those fa, previous figures are given 
5 the same reference numerals and are not described again except 3S necessary for an 
understands of the present embodiment In the model of Fig. 3, a new node is 
introduced, of which level, of the drug" is an output. Thrse inputs of dosage, 

diction features and elimination features are defined as influences on the serum 
level. A progression is shown between the models of Fig, 2 «d 3 which progression 
,0 is typical of how a .odd progress as experts continue to deiibct* over it. 

Reference is now made to Fig. 4, which is a simplified diagram of an 
e^icn th* m» be made to eifher of *e mode.s of Fig. 2 or Fig. 3. to allow the 
mo del to be used in the augment of a clinical trial. The mod,, includes a single 
no de 2S which receives as an input the output "potential for liver toxicity" of the 

of* .ail, involved, the type of ,ive r .o,icit>, and the.everity of *c IWcrto,^. 
The bipws are for the clinical trial as a whole. 

A series of mv*t « *f.„ed as shown in the figure for marking *. ** 
„«d or. the information « -» <* * *"» *•** 

„ «, of IndiCions. new set of ^ » 

patient more closely, ana change environment. 

Reft^c is now made to *. JA. which is a W» • ■**"*» 

to , „ disco,ereo « S i» E *e system of the * h " " 

«*l for the «. of *e ,iver. b Fi, 5 * * *- 
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two liver enzymes, ALT and AST. Generally, levels of both ALT and AST arc 
relatively low in the bloodstream as the enzymes are retained in the liver, In the event 
of trauma within the liver, and especially in the event of larger than normal scale 
death of liver cells, some of the enzyme escapes into the Wood. Thus, the presence of 
> noticeable levels of either of the above enzymes has been used in the past as an 
indicate! of liver toxicity. Indeed in Fig. 5. the vast majority of points plotted are 
close to the origin of the era*. In applying the above embodiment, to the liver 
system it hypothesized that a linear relationship existed between the two 
enzymes. In any case a certain linearity can be seen in the graph. 
10 ^ ; ,odel found thai there is indeed a linear relationship between the two 

I substances, that the linearity Is strong in the case of healthy patients than in 

i panenu already known to be suffering from liver dysfunction. Thus, the potent,, for 

1 hver dysfunction may be predicted by monitoring, not only the overall level, of AST 

S and ALT, but also the extent of the linearity in the relationship *«eberwe e n. 

^ 15 For example a point numbered 33 located in the top right hand comer of the 

S ^h to high levels of both enzymes and would traditionally have been indicated a. 

° a sign of illness. Nevertheless the relationship is here shown to be linear, indicating 

g that the illness * not serious. By contest a point numbered 82 in the lower right part 

of the graph is very non-linear, indicating serious liver dysfunction. 
20 The four numbered points 1202, 10.9, 485 and 1664, around the periphery of 

tht lower loft region of the graph would traditionally have been regarded as being 
healthv on the ground, that neither enzyme is outside the limit. Hoover linearity is 
, le ar,y tow and the present embodiment preferably signals the presence of Hver 
disorder. 
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The above prediction may be incorporated into a model as described with 
respect to Fig. 4 for conducting a clinical trial. It .nay also be incorporated into an 
individual patient model of the kind shown in Figs. 2 and 3. The model may be 
connected directly to an input device for reading in AST and ALT levels. Once the 
5 model has been quantized il will generally include a node whose outputs are defined 
so that they effectively compare the levels of the two drugs. If departure from 
linearity is detected then a diagnosis of iiver disorder may be given. A knowledge 
tree which includes the enzyme levels as inputs and gives a diagnosis as an output is 
stioivn in Fig. 513. Generally, the quantifying stage may find new relationships 
LU between variables or attach greater or lesser importance to previously known or 
suspected relationships, which can now be incorporated into new models. 

It is also possible to build a dedicated device for applying to a patient to 
obtain an indication of liver toxicity. The dedicated device preferably comprises an 
input device for measuring AST and ALT level* and a processor includes comparison 
IS circu.try for producing a ratio. The processor preferably also includes analysis, or 
predictor, functionality which uses either or both of the absolute values and the ratio 
between to decide on the potential for liver toxicity, For example the predictor may 
b* set to conclude from low ALT and AST levels and departure from the linear 
relationship, that a likelihood of development of liver toxicity is low. Likewise the 
20 predictor may be set to conclude from high ALT and AST levels, that a likelihcod of 
iter toxicity is relatively high. Again, the predictor may be set to conclude from 
departure from linearity, without reference to absolute levels, that a likelihood of liver 
toxicity is relatively high. Likewise, the predictor may be set to conclude from a ratio 
close to linearity, without reference to absolute levels, that a likelihood of liver 
25 toxicity is relatively low. In a preferred embodiment, a threshotder may be included 
15 
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in the predictor for setting a threshold likelihood, above which application of the 
pharmaceutical substance is to be discontinued as being too dangerous. The threshold 
is preferably altered for different substances and for different ailments. For example a 
drug being used in a lifcthreatenirg ailment is preferably signed a bigb threshold 
5 for discontinuity whereas a drug thet is being used for a chronic but low level 
condition may be given a lower threshold for discontinuity. 

Reference is now made to Fig. 6, which is a simplified flow chart describing 
a method of modeling an interaction between a phamacaatical sttbstar.ee and a 
biologic! system according to a preferred embodiment of the present invention. 
10 In Fie- 6 a first stage of arranging a state diagram of the interaction to be 

modeled is carried out. Then a stage of adding interactions between the state, and 
inputs to the flat* is carried out. At least one output is also preferably defined. The 
above stages are preferably carried out using a kno^dge «, builder as described 
above. A date set. preferably including empirical data is taken to feed a data mining 
,5 procedure which studies the data for the interactions indicated in the model. A 
quantified model of the interaction is thereby attained. Standard experimental 
procedure provides that the data is divided into three, preferably at random. One of 
5 U)e d(U3 S et 5 is then u«d for data mining, one of the data sets is used for calibration 

and one of the data sets is used for testing the data to ensure that overling of the 
20 data has not occurred, 

,!t is appreciated that certain features of the invention, which are, for clarity, 
described in the context of separate embodiments, may also be provided in 
combination in a single embodiment. Conversely, various features of the invention 
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which are, for brevity, described in the context of a single embodiment, may also be 
provided separately or in any suitable subcombination. 

It will be appreciated by persons skilled in the art that the present invention is 
not limited to what has been particularly shown and described hereinabove. Rather 
5 the scope of the present invention is defined by the appended claims and includes both 
combinations and subcombinations of the various features described hereinabove as 
well as variations and modifications thereof which would occur to parsons skilled in 
the art upon reading the foregoing description. 
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